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DETAILED ACTION 

1 . This action is issued in response to the Amendment filed on 07/27/2006. 

2. Claims 1, 3, 5, 7, 9, 12, 15, 16, and 17 were amended. Claims 4, and 10 were 
canceled. Claims 18-20 were added. 

3. Claims 1, 3, 5, 7, 9, 12, and 15 - 20 are pending in this application. 

4. Applicant's arguments with respect to claims 1, 3, 5, 7, 9, 12, 15, 16, and 17 have 
been considered but are moot in view of the new ground(s) of rejection. 



Terminal Disclaimer 

5. The Terminal Disclaimer filed has been received and acknowledged. However, 
since the rejection was based on a statutory (35 U.S.C. 101) double patenting, the 
terminal disclaimer cannot overcome the double patenting rejection. 

Claim Rejections - 35 USC § 101 

6. 35 U.S.C. 101 reads as follows: 

Whoever invents or discovers any new and useful process, machine, manufacture, or composition of 
matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the 
conditions and requirements of this title. 

7. Claims 1, 3, and 5 are rejected under 35 U.S.C. 101 because: 

The claims lack the necessary physical articles or objects to constitute a machine 
or a manufacture within the meaning of 35 USC 101. They are clearly not a series of 
steps or acts to be a process nor are they a combination of chemical compounds to be 
a composition of matter. As such, they fail to fall within a statutory category. They are, at 
best, functional descriptive material perse. 
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Descriptive material can be characterized as either "functional descriptive 
material" or "nonfunctional descriptive material." Both types of "descriptive material" are 
nonstatutory when claimed as descriptive material perse, 33 F.3d at 1360, 31 USPQ2d 
at 1759. When functional descriptive material is recorded on some computer-readable 
medium, it becomes structurally and functionally interrelated to the medium and will be 
statutory in most cases since use of technology permits the function of the descriptive 
material to be realized. Compare In re Lowry, 32 F.3d 1579, 1583-84, 32 USPQ2d 
1031, 1035 (Fed. Cir. 1994) 

Claim 1 merely claims nonfunctional descriptive material that does not have a 
useful result, which does not make it statutory. See Diehr, 450 U.S. at 185-86, 209 
USPQ at 8 (noting that the claims for an algorithm in Benson were unpatentable as 
abstract ideas because "[t]he sole practical application of the algorithm was in 
connection with the programming of a general purpose computer."). 



Claim Rejections - 35 USC § 102 

8. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 
A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 122(b), by 
another filed in the United States before the invention by the applicant for patent or (2) a patent 
granted on an application for patent by another filed in the United States before the invention by the 
applicant for patent, except that an international application filed under the treaty defined in section 
351(a) shall have the effects for purposes of this subsection of an application filed in the United States 
only if the international application designated the United States and was published under Article 21(2) 
of such treaty in the English language. 
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9. Claims 1, 3, 5, 7, 9, 12, and 15-20 are rejected under 35 U.S.C. 102(e) as 
being anticipated by Alam et al. (Alam hereinafter) (US Patent No. 6,336,124 B1, filed: 
October 1, 1998). 

Regarding Claim 1 and 7, Alam discloses a computer-readable medium having 
stored thereon sequences of instructions, said sequences of instructions including 
instructions which, when executed by a processor, cause said processor to perform the 
steps of: 

receiving a page description language representation of the document (Col. 6, 
lines 24 - 28, Alam) for providing a list of words in the document (Col. 6, lines 57 - 59, 
words, Alam) and position information for the words (Col. 6 and 7, lines 54 - 57 and 60 
- 63; respectively, X and Y coordinates, Alam); and 

automatically identifying table data in the document based on the page 
description language representation of the document and at least one table identifying 
feature (Col. 7, lines 26 - 28 and 32 - 35, Alam), wherein the identifying step includes: 
dividing the document into one or more pages (Col. 15, lines 56 - 58, 
dividing into sub-pages, Alam); 

dividing each page into a plurality of lines (Col. 15, lines 57 - 60, Alam); 
for each line (Col. 8, lines 57 - 60, Alam), clustering the words of the line 
into one or more word clusters (Col. 9, lines 1-6, Alam 1 ), wherein each cluster 
includes one or more words (Col. 16, lines 15-22; if the block is a paragraph 
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that ends with an incomplete sentence or an improper termination ... whether 
the block contains one or more sentences, Alam), each cluster having a 
horizontal beginning point, horizontal midpoint, and horizontal end point (Col. 11, 
lines 8-17, wherein the right most X coordinate corresponds to the horizontal 
beginning point claimed; wherein the center X coordinate corresponds to the 
horizontal midpoint claimed; and wherein the left X coordinate corresponds to the 
horizontal end point claimed; Alam); 

automatically identifying table data in the document based on the number 
of word clusters for each line and the alignment of the horizontal beginning 
points, horizontal midpoints, and horizontal end point of word clusters between 
lines (Col. 1 1 , and 12, lines 8-17, and 47 - 49 and 51 - 52; respectively, Alam). 

Regarding Claim 3, Alam discloses a method wherein the step of automatically 
identifying table data in the document based on the number of word clusters for each 
line and the alignment of the word clusters between lines (Col. 12, lines 47 - 49 and 51 
- 52, Alam) further comprises: 

using the word clusters to generate column position information (Col. 9 and 18, 
lines 10-12 and 9-12; respectively, Alam), wherein the column information includes 
for each column a horizontal beginning point, horizontal midpoint, and horizontal end 
point (Col. 1 1 , lines 8-17, wherein the right most X coordinate corresponds to the 
horizontal beginning point claimed; wherein the center X coordinate corresponds to the 



1 Wherein examiner interprets the step of determining the spacing between words as the step of 
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horizontal midpoint claimed; and wherein the left X coordinate corresponds to the 
horizontal end point claimed; Alam); and 

updating the column position information (Col. 10, lines 42-45, Alam 2 ) by 
performing a union operation between the column position information of a previous line 
(Col. 10, lines 26-29, Y coordinates of the previous lines, Alam) and the column 
position information of a current line (Col. 10, lines 24 - 27, Y coordinates of the 
selected lines, Alam). 

Regarding Claim 5, Alam discloses a method wherein receiving a page 
description language representation of the document for providing a list of words in the 
document and position information for the words includes receiving a PDF 
representation of the document (Col. 2, lines 1-5, portable document format (PDF), 
Alam), and wherein converting the table data encompassed by each table bounding box 
to a markup language representation includes converting the table data encompassed 
by each table bounding box to a HTML representation (Col. 6, lines 42 - 47, Alam). 

Regarding Claim 9, Alam discloses a computer-readable medium further 
containing instructions which, when executed by said processor, would cause said 
processor to perform the steps of: 

using the word clusters to generate column position information (Col. 9 and 18, 
lines 10-12 and 9-12; respectively, Alam), wherein the column information includes 



clustering the words claimed. 
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for each column a horizontal beginning point, horizontal midpoint, and horizontal end 
point (Col. 11, lines 8-17, wherejn the right most X coordinate corresponds to the 
horizontal beginning point claimed; wherein the center X coordinate corresponds to the 
horizontal midpoint claimed; and wherein the left X coordinate corresponds to the 
horizontal end point claimed; Alam); and 

updating the column position information ( Col. 10, lines 42 - 45, Alam 3 ) by 
performing a union operation between the column position information of a previous line 
(Col. 10, lines 26 - 29, Y coordinates of the previous lines, Alam) and the column 
position information of a current line (Col. 10, lines 24 - 27, Y coordinates of the 
selected lines, Alam). 

Regarding Claim 12, Alam discloses a document processing system comprising: 
a processor for executing programs (Fig. 2, item 151, Col. 4, lines 61 - 64, 
Alam); and 

a table identification program (Col. 4, lines 43 - 46, Alam) for receiving a page 
description language representation of a document (Col. 6, lines 24 - 28, Alam), the 
page description language representation providing a list of words in the document (Col. 
6, lines 57 - 59, words, Alam) and position information for the words (Col. 6 and 7, lines 
54 - 57 and 60 - 63; respectively, X and Y coordinates, Alam), and for automatically 
identifying table data in the document based on the page description language 

2 Wherein examiner interprets the step of determining that the line is not near the current paragraph as 
the step of updating position information claimed. 

3 Wherein examiner interprets the step of determining that the line is not near the current paragraph as 
the step of updating position information claimed. 
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representation of the document and at least one table identifying feature (Col. 7, lines 
26 - 28 and 32 - 35, Alam), wherein the identification program is configured to, 

dividing the document into one or more pages (Col. 15, lines 56 - 58, 
dividing into sub-pages, Alam); 

dividing each page into a plurality of lines (CoL 15, lines 57 - 60, Alam); 

for each line (Col. 8, lines 57 - 60, Alam), cluster the words of the line into 
one or more word clusters (Col. 9, lines 1-6, Alam 4 ), wherein each cluster 
includes one or more words (Col. 16, lines 15-22; if the block is a paragraph 
that ends with an incomplete sentence or an improper termination ... whether 
the block contains one or more sentences, Alam), each cluster having a 
horizontal beginning point, horizontal midpoint, and horizontal end point (CoL 11, 
lines 8-17, wherein the right most X coordinate corresponds to the horizontal 
beginning point claimed; wherein the center X coordinate corresponds to the 
horizontal midpoint claimed; and wherein the left X coordinate corresponds to the 
horizontal end point claimed; Alam); and 

automatically identify table data in the document based on the number of 
word clusters for each line and the alignment of the horizontal beginning points, 
horizontal midpoints, and horizontal end point of word clusters between lines 
(Col. 11, and 12, lines 8- 17, and 47-49 and 51 -52; respectively, Alam). 



4 Wherein examiner interprets the step of determining the spacing between words as the step of 
clustering the words claimed. 
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Regarding Claim 15, Alam discloses a document processing system of claim 13 
wherein the table identification program further comprises: 

a conversion module (Fig. 6, item 628, Col. 6, lines 41 - 45, converter, Alam) 
coupled to the bounding box generation module for receiving the table bounding box for 
each table in the document (Fig. 6, item 612, Col. 6, lines 35 - 40, ACROBAT 
CAPTURE and ACROBAT WRITER, Alam), and for converting the words encompassed 
by the table bounding box into a markup language representation that maintains the 
table structure of each table (Col. 6, lines 42 - 47, Alam). 

Regarding Claim 16, Alam discloses a method wherein the step of automatically 
identifying table data in the document based on the page description language 
representation of the document and at least one table identifying feature further 
comprises: 

automatically identifying table data in the document based on one or more table 
headings (Col. 2, lines 39 - 44, Alam). 

Regarding Claim 17, Alam discloses a method wherein the step of automatically 
identifying table data in the document based on the page description language 
representation of the document and at least one table identifying feature further 
comprises: 

automatically identifying table data in the document based on one or more 
horizontal lines (Fig 15B, Col. 14, lines 38 - 43, five rows, Alam) and vertical lines that 
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separate rows or columns of the table (Fig. 15C, Col. 14, lines 46 -49, two columns, 
Alam). 

Regarding Claim 18, Alam discloses a method, wherein the step of automatically 
identifying table data in the document based on the number of word clusters for each 
line and the alignment of the word clusters comprises: 

determining whether the number of word clusters in a line is greater than a 
threshold value (Col. 8, lines 21 - 26; If the inter-word spacing or distance in the Y 
direction is greater than a threshold...; Alam); and 

classifying the word clusters in the line as a row of a table in response to the 
number of word clusters in a line being greater than the threshold value (Col. 8, lines 28 
- 34; Alam). 

Regarding Claim 19, Alam discloses a computer-readable medium, wherein the 
instructions for automatically identifying table data in the document based on the 
number of word clusters for each line and the alignment of the word clusters include 
instructions that when executed by a processor cause the processor to perform the 
steps further comprising: 

determining whether the number of word clusters in a line is greater than a 
threshold value (Col. 8, lines 21 - 26; If the inter-word spacing or distance in the Y 
direction is greater than a threshold...; Alam); and 
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classifying the word clusters in the line as a row of a table in response to the 
number of word clusters in a line being greater than the threshold value (Col. 8, lines 28 

- 34; Alam). 

Regarding Claim 20, Alam discloses a document processing system, wherein the 
table identification program is further configured to: 

determine whether the number of word clusters in a line is greater than a 
threshold value (Col. 8, lines 21 - 26; If the inter-word spacing or distance in the Y 
direction is greater than a threshold...; Alam); and 

classify the word clusters in the line as a row of a table in response to the 
number of word clusters in a line being greater than the threshold value (Col. 8, lines 28 

- 34; Alam). 
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Response to Arguments 

1 . Applicant argues that the prior art fails to disclose; "handling left justifies, right 
justified or centered justified text" or "left justified, right justified or centered text is used 
in identifying table data". 

Examiner respectfully disagrees. The limitation including: handling left justified, 
right justified or centered justified text is not disclosed in the claim language. However, 
the applied prior art does disclose handling left justified, right justified or centered 
justified text (Col. 10, lines 60 - 63; left, right or center alignment; Alam); and left 
justified, right justified or centered text used in identifying table data (Col. 7, and 10, 
lines 5-9, and 60 - 63; respectively, Alam). 
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Prior Art Made Of Record 

1. Alam et al. (US Patent No. 6,336,124 B1 , filed: October 1, 1998) discloses 
conversion data representating a document to other formats for manipulation and 
display. 

2. Egger et al. (US Patent No. 6,233,571 B1) discloses a method and apparatus for 
indexing searching and displaying data. 
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Points Of Contact 



Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Giovanna Colan whose telephone number is (571) 272- 
2752. The examiner can normally be reached on 8:30 am - 5:00 pm. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, John Breene can be reached on (571) 272-4107. The fax phone number for 
the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 



Giovanna Colan 
Examiner 
Art Unit 2162 
October 4, 2006 




